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(57) ABSTRACT 

Transforms are used for transcoding input text, audio and/or 
video input to provide a choice of text, audio and/or video 
output. Transcoding may be performed at a system operated 
by the communications originator, an intermediate transfer 
point in the communications path, and/or at one or more 
system(s) operated by the recipients). Transcoding of the 
communications input, particular voice and image portions, 
may be employed to alter identifying characteristics to 
create an avatar for a user originating the communications 
input. 
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DYNAMIC DESTINATION-DETERMINED 
MULTIMEDIA AVATARS FOR INTERACTIVE 
ON-LINE COMMUNICATIONS 

BACKGROUND OF THE INVENTION 

1. Technical Field 

The present invention generally relates to interactive 
communications between users and in particular to altering 
identifying attributes of a participant during interactive 
communications. Still more particularly, the present inven- 
tion relates to altering identifying audio and/or video 
attributes of a participant during interactive 
communications, whether textual, audio or motion video. 

2. Description of the Related Art 

Individuals use aliases or "screen names" in chat rooms 
and instant messaging rather than their real name for a 
variety of reasons, not the least of which is security. An 
avatar, an identity assumed by a person, may also be used in 
chat rooms or instant messaging applications. While an alias 
typically has little depth and is usually limited to a name, an 
avatar may include many other attributes such as physical 
description (including gender), interests, hobbies, etc. for 
which the user provides inaccurate information in order to 
create an alternate identity. 

As available communications bandwidth and processing 
power increases while compression/transmission techniques 
simultaneously improve, the text-based communications 
employed in chat rooms and instant messaging is likely to be 
enhanced and possibly replaced by voice or auditory com- 
munications or by video communications. Audio and video 
communications over the Internet are already being 
employed to some extent for chat rooms, particularly those 
providing adult-oriented content, and for Internet telephony. 
"Web" motion video cameras and video cards are becoming 
cheaper, as are audio cards with microphones, so the move- 
ment to audio and video communications over the Internet 
is likely to expand rapidly. 

For technical, security, and aesthetic reasons, a need exists 
to allow users control over the attributes of audio and/or 
video communications. It would also be desirable to allow 
user control over identifying attributes of audio and video 
communications to create avatars substituting for the user. 

SUMMARY OF THE INVENTION 

It is therefore one object of the present invention to 
improve interactive communications between users. 

It is another object of the present invention to alter 
identifying attributes of a participant during interactive 
communications. 

It is yet another object of the present invention to alter 
identifying audio and/or video attributes of a participant 
during interactive communications, whether textual, audio 
or motion video. 

The foregoing objects are achieved as is now described. 
Transforms are used for transcoding input text, audio and/or 
video input to provide a choice of text, audio and/or video 
output. Transcoding may be performed at a system operated 
by the communications originator, an intermediate transfer 
point in the communications path, and/or at one or more 
system(s) operated by the recipient(s). Transcoding of the 
communications input, particular voice and image portions, 
may be employed to alter identifying characteristics to 
create an avatar for a user originating the communications 
input. 

The above as well as additional objectives, features, and 
advantages of the present invention will become apparent in 
the following detailed written description. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The novel features believed characteristic of the invention 
are set forth in the appended claims. The invention itself 
however, as well as a preferred mode of use, further objects 
and advantages thereof, will best be understood by reference 
to the following detailed description of an illustrative 
embodiment when read in conjunction with the accompa- 
nying drawings, wherein: 

FIG. 1 depicts a data processing system network in which 
a preferred embodiment of the present invention may be 
implemented; 

FIGS. 2A-2C are block diagrams of a system for provid- 
ing communications avatars in accordance with a preferred 
embodiment of the present invention; 

FIG. 3 depicts a block diagram of communications 
transcoding among multiple clients in accordance with a 
preferred embodiment of the present invention; 

FIG. 4 is a block diagram of serial and parallel commu- 
nications transcoding in accordance with a preferred 
embodiment of the present invention; and 

FIG. 5 depicts a high level flow chart for a process of 
transcoding communications content to create avatars in 
accordance with a preferred embodiment of the present 
invention. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 

With reference now to the figures, and in particular with 
reference to FIG. 1, a data processing system network in 
which a preferred embodiment of the present invention may 
be implemented is depicted. Data processing system net- 
work 100 includes at least two client systems 102 and 104 
and a communications server 106 communicating via the 
Internet 108 in accordance with the known art. Accordingly, 
clients 102 and 104 and server 106 communicate utilizing 
HyperText Transfer Protocol (HTTP) data transactions and 
may exchange HyperText Markup Language (HTML) 
documents, Java applications or applets, and the like. 

Communications server 106 provides "direct" communi- 
cations between clients 102 and 104 — that is, the content 
received from one client is transmitted directly to the other 
client without "publishing" the content or requiring the 
receiving client to request the content. Communications 
server 106 may host a chat facility or an instant messaging 
facility or may simply be an electronic mail server. Content 
may be simultaneously multicast to a significant number of 
clients by communications server 106, as in the case of a 
chat room. Communications server 106 enables clients 102 
and 104 to communicate, either interactively in real time or 
serially over a period of time, through the medium of text, 
audio, video or any combination of the three forms. 

Referring to FIGS. 2 A through 2C, block diagrams of a 
system for providing communications avatars in accordance 
with a preferred embodiment of the present invention are 
illustrated. The exemplary embodiment, which relates to a 
chat room implementation, is provided for the purposes of 
explaining the invention and is not intended to imply any 
limitation. System 200 as illustrated in FIG. 2A includes 
browsers with chat clients 202 and 204 executing within 
clients 102 and 104, respectively, and a chat server 206 
executing within communications server 106. Communica- 
tions input received from chat clients 202 and 204 by chat 
server 206 is multicast by chat server 206 to all participating 
users, including clients 202 and 204 and other users. 

In the present invention, system 200 includes transcoders 
20S for converting communications input into a desired 
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communications output format. Transcoders 208 alter prop- speech-to-speech transcoding may be employed, to generate 

erties of the communications input received from one of a particular accent or age/gender characteristics. The 

clients 202 and 204 to match the originator's specifications receiver may also retain rights to adjust speed, volume, and 

210 and also to match the receiver's specifications 212. tonal controls in keeping with basic sound system manipu- 

Because communications capabilities may vary (i.e., com- 5 lations (e.g. bass, treble, midrange). 

munications access bandwidth may effectively preclude Text-to-text transcoding may involve translation from one 

receipt of audio or video), transcoders provide a full range language to another. Translation of text between languages 

of conversions as illustrated in Table I: is currently possible, and may be applied to input text 

converted on the fly during transmission. Additionally, text- 

TABLE I 10 to-text conversion may be required as an intermediate step 

in audio-to-audio transcoding between languages, as 
described in further detail below. 

Audio-to-video and text-to-video transcoding may 
involve computer generated and controlled video images, 
j5 such as anime (animated cartoon or caricature images) or 
even realistic depictions. Text or spoken commands (e.g., 

Through audio-to-audio (speech-to-speech) transcoding, ^Z™*" u or would cause generated images to 

the speech originator is provided with control over the basic P erform . the corresponding action. 

presentation of their speech content to a receiver, although For video-to-audio and video-to-text transcoding, origin 
the receiver may retain the capability to adjust speed, 2 o video tv P icallv includes audio (for example, within the 
volume and tonal controls in keeping with basic sound well-known layer 3 of the Motion Pictures Expert Group 
system manipulations (e.g. bass, treble, midrange). Intelli- specification, more commonly referred to as "MP3"). For 
gent speech-to-speech transforms alter identifying speech video-to-audio transcoding, simple extraction of the audio 
characteristics and patterns to provide an avatar (alternative portion maybe performed, or the audio track may also be 
identity) to the speaker. Natural speech recognition is uti- 25 transcoded for utilizing the audio-to -audio transcoding tech- 
lized for input, which is contextually mapped to output. As niques described above. For video-to-text transcoding, the 
available processing power increases and natural speech audio track may be extracted and transcribed utilizing audio- 
recognition techniques improve, other controls may be pro- to-text coding techniques described above, 
vided such as contextual mapping of speech input to a Video-to -video transcoding may involve simple digital 
different speech characteristics — such as adding, removing 30 filtering (e.g., to change hair color) or more complicated 
or changing an accent (e.g., changing a Southern U.S. accent conversions of video input to corresponding computer gen- 
to a British accent), changing a child's voice to an adult's or erated and controlled video images described above in 
vice versa, and changing a male voice to a female voice or connection with audio-to-video and text-to-video transcod- 
vice versa — or to a different speech pattern (e.g., changing ing. 

a New Yorker's speech pattern to a Londoner's speech 35 In the present invention, communication input and recep- 

pattern). tion modes are viewed as independent. While the originator 

For audio-to-text transcoding the originator controls the may transmit video (and embedded audio) communications 

manner in which their speech is interpreted by a dictation input, the receiver may lack the ability to effectively receive 

program, including, for example, recognition of tonal either video or audio. Chat server 206 thus identifies the 

changes or emphasis on a word or phrase which is then 40 input and reception modes, and employs transcoders 208 as 

placed in boldface, italics or underlined in the transcribed appropriate. Upon "entry" (logon) to a chat room, partici- 

text, and substantial increases in volume resulting in the text pants such as clients 202 and 204 designate both the input 

being transcribed in all capital characters. Additionally, and reception modes for their participation, which may be 

intelligent speech to text transforms would transcode state- identical or different (i.e., both send and receive video, or 

ments or commands to text shorthand, subtext or "emoti- 45 send text and receive video). Server 206 determines which 

con". Subtext generally involves delimited words conveying transcoding techniques described above are required for all 

an action (e.g., "<grin>") within typed text. Emoticons input modes and all reception modes. When input is 

utilize various combinations of characters to convey emo- received, server 206 invokes the appropriate transcoders 208 

tions or corresponding facial expressions or actions. and multicasts the transcoded content to the appropriate 

Examples include; :) or :-) or :-D or d;*) for smiles, :(for a 50 receivers. 

frown, ;-) or; -D for a wink; -P for a "raspberry" (sticking With reference now to FIG. 3, a block diagram of com- 
out tongue), and :-|, :-> or :-x for miscellaneous exprcs- munications transcoding among multiple clients in acces- 
sions; With speecb-to-text transcoding in the present dance with a preferred embodiment of the present invention 
invention, if the originator desired to present a smile to the is depicted. Chat server 206 utilizes transcoders 208 to 
receiver, the user might state "big smile", which the 55 transform communications input as necessary for multicast- 
transcoder would recognize as an emoticon command and ing to all participants. In the example depicted, four clients 
generate the text ":-D". Similarly, a user stating "frown" 302, 304, 306 and 308 are currently participating in the 
would result in the text string ":-(" within the transcribed active chat session. Client A 302 specifies text-based input 
text. to chat server 206, and desires to receive content in text 
For text-to-audio transcoding, the user is provided with 60 form. Client B 304 specifies audio input to chat server 206, 
control over the initial presentation of speech to the receiver. and desires to receive content in both text and audio forms. 
Text-to-audio transcoding is essentially the reverse of audio- Client C 306 specifies text-based input to chat server 206, 
to-text transcoding in that text entered in all capital letters and desires to receive content in video mode. Client D 308 
would be converted to increased volume on the receiving specifies video input to chat server 206, and desires to 
end. Additionally, short hand chat symbols (emoticons) 65 receive content in both text and video modes, 
would convert to appropriate sounds (e.g., ":-P" would Under the circumstances described, chat server 206, upon 
convert to a raspberry sound). Additionally, some aspects of receiving text input from client A 302, must perform text- 
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to-audio and text-to-video transcoding on the received input, 
then multicast the transcoded text form of the input content 
to client A 302, client B 304, and client D 308, transmit the 
transcoded audio mode content to client B 308, and multi- 
cast the transcoded video mode content to client C 306 and 5 
client D 308. Similarly, upon receiving video mode input 
from client D 308, server 206 must initiate at least video- 
to-text and video-to-audio transcoding, and perhaps video - 
to- video transcoding, then multicast the transcoded text 
mode content to client A 302, client B 304, and client D 308, 1Q 
transmit the transcoded audio mode content to client B 308, 
and multicast the (transcoded) video mode content to client 
C 306 and client D 308. 

Referring back to FIG. 2A, transcoders 206 may be 
employed serially or in parallel on input content. FIG, 4 1S 
depicts serial transcoding of audio mode input to obtain 
video mode content, using audio-to-text transcoder 208a to 
obtain intermediate text mode content and text-to-video 
transcoder 208b to obtain video mode content. FIG. 4 also 
depicts parallel transcoding of the audio input utilizing 20 
audio-to-audio transcoder 208c to alter identifying charac- 
teristics of the audio content. The transcoded audio is 
recombined with the computer-generated video to achieve 
the desired output. 

By specifying the manner in which input is to be 2 5 
transcoded for all three output forms (text, audio and video), 
a user participating in a chat session on chat server 206 may 
create avatars for their audio and video representations. It 
should be noted, however, that the processing requirements 
for generating these avatars through transcoding as 30 
described above could overload a server. Accordingly, as 
shown in FIG. 2B and 2C, some or all of the transcoding 
required to maintain an avatar for the user may be trans- 
ferred to the client systems 102 and 104 through the use of 
client-based transcoders 214. Transcoders 214 may be 35 
capable of performing all of the A different types of 
transcoding described above prior to transmitting content to 
chat server 206 for multicasting as appropriate. The elimi- 
nation of transcoders 208 at the server 106 may be appro- 
priate where, for example, content is received and transmit- 40 
ted in all three modes (text, audio and video) to all 
participants, which selectively utilize one or more modes of 
the content. Retention of server transcoders 208 may be 
appropriate, however, where different participants have dif- 
ferent capabilities (i.e., one or more participants can not 45 
receive video transmitted without corresponding transcoded 
text by another participant). 

With reference now to FIG. 5, a high level flow chart for 
a process of transcoding communications content to create 
avatars in accordance with a preferred embodiment of the 50 
present invention is depicted. The process begins at step 502, 
which depicts content being received for transmission to one 
or more intended recipients. The process passes first to step 
504, which illustrates determining the input mode(s) (text, 
speech or video) of the received content. 5s 

If the content was received in at least text-based form, the 
process proceeds to step 506, which depicts a determination 
of the desired output mode(s) in which the content is to be 
transmitted to the recipient. If the content is to be transmitted 
in at least text form, the process then proceeds to step 508, 60 
which illustrates text-to-text transcoding of the received 
content. If the content is to be transmitted in at least audio 
form, the process then proceeds to step 510, which depicts 
text-to-audio transcoding of the received content. If Dent, 
the content is to be transmitted in at least video form, the 65 
process then proceeds to step 512, which illustrates text-to- 
video transcoding of the received content. 
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Referring back to step 504, if the received content is 
received in at least audio mode, the process proceeds to step 
514, which depicts a determination of the desired output 
mode(s) in which the content is to be transmitted to the 
recipient. If the content is to be transmitted in at least text 
form, the process then proceeds to step 516, which illustrates 
audio-to-text transcoding of the received content. If the 
content is to be transmitted in at least audio form, the process 
then proceeds to step 518, which depicts audio-to-audio 
transcoding of the received content. If the content is to be 
transmitted in at least video form, the process then proceeds 
to step 520, which illustrates audio-to-video transcoding of 
the received content. 

Referring again to step 504, if the received content is 
received in at least video mode, the process proceeds to step 
522, which depicts a determination of the desired output 
mode(s) in which the content is to be transmitted to the 
recipient. If the content is to be transmitted in at least text 
form, the process then proceeds to step 524, which illustrates 
video-to-text transcoding of the received content. If the 
content is to be transmitted in at least audio form, the process 
then proceeds to step 526, which depicts video-to-audio 
transcoding of the received content. If the content is to be 
transmitted in at least video form, the process then proceeds 
to step 528, which illustrates video- to-video transcoding of 
the received content. 

From any of steps 508, 510, 512, 516, 518, 520, 524, 526, 
or 528, the process passes to step 530, which depicts the 
process becoming idle until content is once again received 
for transmission. The process may proceed down several of 
the paths depicted in parallel, as where content is received 
in both text and audio modes (as where dictated input has 
previously been transcribed) or is desired in both video and 
text mode (for display with the text as "subtitles"). 
Additionally, multiple passes through the process depicted 
may be employed during the course of transmission of the 
content to the final destination. 

The present invention provides three points for control- 
ling communications over the Internet: the sender, an inter- 
mediate server, and the receiver. At each point, transforms 
may modify the communications according to the transcod- 
ers available to each. Communications between the sender 
and receiver provide two sets of modifiers which may be 
applied to the communications content, and introduction of 
an intermediate server increases the number of combinations 
of transcoding which may be performed. Additionally, for 
senders and receivers that do not have any transcoding 
capability, the intermediate server provides the resources to 
modify and control the communications. Whether per- 
formed by the sender or the intermediate server, however, 
transcoding may be utilized to create an avatar for the 
sender. 

It is important to note that while the present invention has 
been described in the context of a fully functional data 
processing system and/or network, those skilled in the art 
will appreciate that the mechanism of the present invention 
is capable of being distributed in the form of a computer 
usable medium of instructions in a variety of forms, and that 
the present invention applies equally regardless of the par- 
ticular type of signal bearing medium used to actually carry 
out the distribution. Examples of computer usable mediums 
include: nonvolatile, hard-coded type mediums such as read 
only memories (ROMs) or erasable, electrically program- 
mable read only memories (EEPROMs), recordable type 
mediums such as floppy disks, hard disk drives and 
CD-ROMs, and transmission type mediums such as digital 
and analog communication links. 
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While the invention has been particularly shown and destination utilizing a transcoder selected from the 
described with reference to a preferred embodiment, it will group consisting of a text-to-text transcoder, a text-to- 
be understood by those skilled in the art that various changes audio transcoder, a text-to-video transcoder, an audio- 
in form and detail may be made therein without departing to-text transcoder, an audio-to-audio transcoder, an 
from the spirit and scope of the invention. s audio-to-video transcoder, a video-to-text transcoder, a 

What is claimed is: video-to-audio transcoder, and a video-to-video 

1. A method for controlling communications, comprising: transcoder. 

receiving communications content and determining a text, 9 * The system of claim 8, wherein the means for transcod- 

audio, or video input mode of the content; in S the content from the text, audio, or video input mode to 

determining a user-specified text, audio, or video output 10 the. user-specified text, audio or video output mode prior to 

mode for the content for delivering the content to a delivering the content to the destination further comprises: 

destination- and means for transcoding the content at a system at which the 

transcoding the content from the text, audio, or video < ^ C °™ ent * S "^^Jf received. . 

, , °, tn _ , ' j* _ • 10. The system of claim 8, wherein the means for 

input mode to the user-specified text, audio, or video / A , *■ .. . 

. t . t j r • + . . * * 15 transcodmg the content from the text, audio, or video input 

output mode prior to delivering the content to the . . & , ,. ' . A r , 

. t - . . * . i * j f mode to the user-specined text, audio, or video output mode 

destination utilizing a transcoder selected from the . , . , ! . ■ * 

j * * * pnor to delivering the content to the destination further 

group consisting of a text-to-text transcoder, a text-to- p . & 

audio transcoder, a text-to-video transcoder, an audio- comprises. 

to-text transcoder, an audio -to-audio transcoder, an means for transcoding the content at a system intermedi- 

audio-to-video transcoder, a video-to-text transcoder, a 20 ate to a svstem at wnich the mn * ni is mUiall y rece,ved 

video-to-audio transcoder, and a video-to-video and a s y stem t0 which the c °ntent is delivered, 

transcoder s y stem °f claim 8, wherein the means for 

2. The method of claim 1, wherein the step of transcoding transcoding the content from the text, audio, or video input 
the content from the text, audio, or video input mode to the „ mode t0 ih& user-specified text, audio, or video output mode 
user-specified text, audio, or video output mode prior to 25 P rior 10 delivering the content to the destination further 
delivering the content to the destination further comprises: comprises: 

transcoding the content at a system at which the content means for transcoding the content at a system to which the 

is initially received content is delivered. 

3. The method of claim 1, wherein the step of transcoding 30 12 ™ e ^» tenJ of f claim «- where «! ,he m f ans fo ' 
the content from the text, audio, or video input mode to the transcoding the content from the text, audio, or video input 
user-specified text, audio, or video output mode prior to m ? de t0 the user-spe«fied text, aud.o or video output mode 
delivering the content to the destination further comprises: P nor t0 delivc ™g the cont ent to the destination further 

,. * j • a a comprises: 

transcoding the content at a system intermediate to a _ . c • 

* l- u **■•*.• 11 * ,1 a means for creating an avatar for an originator or the 

system at which the content is initially received and a 35 , , . u & . . . ~ . c 4 , 

3 t , u u*u ♦ ♦ ■ -1 i- j content by altenng identifying characterisUcs of the 

system to which the content is delivered 3 & 3 & 



4. The method of claim 1, wherein the step of transcoding „ fc * 1 ■ 1 * i_ • .u c 

t , . - . . 1 • > j . . , , * 13. The system of claim 12, wherein the means for 
the content from the text, audio, or video input mode to the . t * ■ • / f< , ♦ .u 1. • 

• c , 4 . j. j . . j - creating an avatar for an originator of the content by altenng 

user-specified text, audio, or video output mode pnor to t .~ & . . . %. t t t 3 . & 

1 , . . : ' . ' . K. v . identifying characteristics of the content further comprises: 

delivering the content to the destination further comprises: 40 ? , . , , . ■ c , . . 

....... . . means for altenng speech characteristics of the originator. 

transcoding the content at a system to which the content u ^ system B o£ " claim 12> wherein , he means for 

* ^ 6 1V fu C j' c 1 ^ u • , j- creating an avatar for an originator of the content by altering 

5. The method of claim 1, wherein the step of transcoding .« , .... T L , r J 0 
in. ii.uuuu ul v aiui x, uvivu. u. & identifying characteristics of the content further comprises: 

the content from the text, audio, or video input mode to the J ~ . , , . , r r . 

user-specified text, audio, or video output mode prior to 45 ^eans for altering pitch, tone, bass or mid-nnge of the 

delivering the content to the destination further comprises: ^ en ' , 

, 15. A computer program product within a computer 

creating an avatar for an originator of the content by usab , e medjum for controlling communications, compris- 
altering identifying characteristics of the content. . 

6. The method of claim 5, wherein the step of creating an . . - . . ... 
t c . . t r#u . 1 1 i- ■ • i < r . en instructions for receiving communications content and 

avatar for an onginator of the content by altering identifying 5U . ... . . flL t 4 

. t . t . % .u . * t *u • deter a text, audio, or video input mode of the content; 

charactenstics of the content further comprises: . . ' 1 . , r 

. . , , . t . - JL ... instructions for determining a user-specified text, audio, 

altenng speech characteristics of the onginator. ., . t , c °.. . r t c , . 4 . 

m rp, iL ,r,. e i. . t r <■ or video output mode for the content for delivering the 

7. I ne method of claim 5, wherein the step of creating an « . . j j 
A c ... c i_ i . r . .. .-r • content to a destination; and 

avatar for an onginator of the content by altering identifying . . - ,. . 

characteristics of the content further comprises: 55 '-^ructions for transcoding the content from the text, 

_ , audio, or video input mode to the user-specined text, 

altenng p.tch tone, bass or mid-range of the content. audi of video . mode rior (o deliveri the 

8. A system for controlling communications, compnsing: tQ (hc dcstination utilizing a transcodcr 
means for receiving communications content and deter- from lhc group consisting of a text-to-tcxt transcoder, a 

mining a text, audio, or video input mode of the 60 text-to-audio transcoder, a text-to-video transcoder, an 
content; audio-to-text transcoder, an audio -to -audio transcoder, 

means for determining a user-specified text, audio, or and audio-to-video transcoder, a video-to-text 

video output mode for the content for delivering the transcoder, a video-to-audio transcoder, and a video- 

content to a destination; and to-video transcoder. 

means for transcoding the content from the text, audio, or 65 16. The computer program product of claim 15, wherein 
video input mode to the user-specified text, audio, or the instructions for transcoding the content from the text, 
video output mode prior to delivering the content to the audio, or video input mode to the user-specified text, audio, 
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or video output mode prior to delivering the content to the 
destination further comprises: instructions for transcoding 
the content at a system at which the content is initially 
received. 

17. The computer program product of claim 15, wherein 5 
the instructions for transcoding the content from the text, 
audio, or video input mode to the user-specified text, audio, 
or video output mode prior to delivering the content to the 
destination farther comprises: 

instructions for transcoding the content at a system inter- 1( > 
mediate to a system at which the content is initially 
received and a system to which the content is delivered. 

18. The computer program product of claim 15, wherein 
the instructions for transcoding the content from the text, 
audio, or video input mode to the user-specified text, audio, 
or video output mode prior to delivering the content to the 
destination further comprises: 

instructions for transcoding the content at a system to 
which the content is delivered. 

19. The computer program product of claim 15, wherein 
the instructions for transcoding the content from the text, 



audio, or video input mode to the user-specified text, audio, 
or video output mode prior to delivering the content to the 
destination further comprises: 

instructions for creating an avatar for an originator of the 
content by altering identifying characteristics of the 
content. 

20. The computer program product of claim 19, wherein 
the instructions for creating an avatar for an originator of the 
content by altering identifying characteristics of the content 
further comprises: 

instructions for altering speech characteristics of the origi- 
nator. 

21. The computer program product of claim 19, wherein 
35 the instructions for creating an avatar for an originator of the 

content by altering identifying characteristics of the content 
further comprises: 

instructions for altering pitch, tone, bass or mid- range of 
the content. 

20 



09/22/2003, EAST Version: 1.04.0000 



UNITED STATES PATENT AND TRADEMARK OFFICE 

CERTIFICATE OF CORRECTION 



PATENT NO. : 6,453,294 Bl Page 1 of 1 

DATED : September 17, 2002 

INVENTOR(S) :Duttaetal. 
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Line 64, please delete "Dent". 



Signed and Sealed this 
Eleventh Day of March, 2003 




JAMES E. ROGAN 
Director of the United States Patent and Trademark Office 



09/22/2003, EAST Version: 1.04.0000 



